Instantaneous Spreading and Einstein Causality in 

Quantum Theory [] 

Gerhard C. Hegerfeldt 

Institut fur Theoretische Physik 
Universitat Gottingen 
Bunsenstr. 9 
37073 Gottingen, Germany 

Abstract 

In nonrelativistic quantum mechanics the wave- function of a free parti- 
cle which initially is in a finite volume immediately spreads to infinity. In 
a nonrelativistic theory this is of no concern, but we show that the same 
instantaneous spreading can occur in relativistic quantum theory and that 
transition probabilities in widely separated systems may instantaneously 
become nonzero. We discuss how this affects Einstein causality. 



1 Introduction 

The principle of the limiting role of the velocity of light is often called 'Einstein 
causality' [T] . Einstein's principle of finite signal velocity is of fundamental im- 
portance for the foundations of physics, both in classical as well as in quantum 
physics. 

Now consider nonrelativistic quantum mechanics and a free particle whose 
wave function at time t — vanishes outside some finite region V. The latter 
implies that the particle is in V with probability 1, and thus with certainty. 
The free nonrelativistic time evolution is then such that at an arbitrarily short 
time later the wave function is nonzero arbitrarily far away from the original 
region V @. Thus the wave function instantaneously spreads to infinity and the 
probability of finding the particle arbitrarily far away from the initial region is 
nonzero for any t > 0. But since this superluminal propagation happens in a 
nonrelativistic theory it is of no great concern. 

Relativistic wave equations which are used to describe free relativistic particles 
are hyperbolic and therefore the propagation is at most luminal so that the above 
problem does not seem to arise. However, if a wave function y?(x, t) satisfies a 
relativistic wave equation then the connection between the variable x and the 
position of the particle is in general not clear-cut. In fact, the position operator 

1 Talk given at the workshop Superluminal (?) Velocities, Cologne, June 6-10, 1998. A talk 
with a similar content was also given by the author at the symposium Uncertain Reality, New 
Delhi, January 5-9, 1998. 



proposed by Newton and Wigner || is not multiplication by x but rather is a 
nonlocal function of x. It was noted by Weidlich and Mitra @] and by Fleming 
H that with the Newton- Wigner position operator superluminal propagation can 
occur in the localization of a free relativistic particle. Amrein || pointed out that 
this happened also for a proposed position operator for the photon. A similar 
phenomenon occurred in some models in which localization was expressed by 
means of a current-density four- vector as shown by Gerlach, Gromes, and Petzold 

In 1974 the present author |§ showed that this phenomenon of instantaneous 
spreading is quite general for a free relativistic particle, irrespective of the partic- 
ular notion of localization, be it in the sense of Newton- Wigner or others. Later 
an alternative proof of this result was given by Skagerstam ||, and it was ex- 



tended to relativistic systems by Perez and Wilde [ 10 1 and to quite general, not 



necessarily relativistic, interactions by Ruijsenaars and the author JTT|. The up- 



shot of Ref. [|TT[] was that this instantaneous spreading is mainly due to positivity 
of the energy plus translation invariance. Neither relativity nor field theory is 
needed. It was recently shown by the author JL2] that translation invariance is 
also not needed. Hilbert space and positivity of the Hamiltonian (energy) suffices 
to ensure either instantaneous spreading or, alternatively, confinement in a fixed 
region for all times. 

A further extension was given by the author [13j for free relativistic particles 
and for relativistic systems which have exponentially bounded tails in their local- 
ization outside some region V. It was shown that the state spreads out to infinity 
faster than allowed by a probability flow with finite propagation speed. 

In the following we give a brief proof of our basic result of Ref. H; for purely 
pedagogical reasons we use relativistic invariance there. We then present an 
analysis of Fermi's two-atom system [14|] where we follow our Refs. JL5, 16| . A 



large portion will be devoted to a discussion what these results mean for Einstein 
causality. Probably the most astonishing fact about our results is that so little 
is needed to derive them. No field theory, no relativity, only Hilbert space and 
positivity of the energy is needed. 



2 Spreading for particles 

If ^t(x) denotes a wave function of a particle in nonrelativistic quantum mechan- 
ics then the probability, P^ t (V), of finding the particle in a region V at time t is 
given by 

P*t(V) = [ M*)\ 2 d 3 x = (A, X vA) (1) 
Jv 

where Xv( x ) is the characteristic function of V and equals 1 in V and out- 
side. Regarded as a multiplication operator, xv is a projector of the spectral 



2 



decomposition of the usual nonrelativistic position operator X. For the Newton- 
Wigner position operators there are analogous projection operators which de- 
termine P^ t (V), but which are no longer given by a simple multiplication by a 
function. 

In this section no position operator will be assumed nor any form of projection 
operators. But rather I will assume just the existence of some operator N(V) such 
that the probability P^ t (V) of finding the particle in V at time t is given by 

P^ t (V) = (^N(V)^ t ) (2) 

where now ip t is the (abstract) state vector of the free relativistic particle. Because 
probabilities lie between and 1 one must have 

< N(V) < 1 . (3) 

This implies in particular that N(V) is hermitian and that N{V) 1 ^ 2 exists as a 
hermitian positive operator. No explicit form of N(V) will be assumed in the 
following. 

The time development of the free particle is described by the generator P° = 
y/P + m 2 for time translation in an irreducible representation of the Poincare 
group for arbitrary mass m > and spin s — 0, |, ... . By 

17(a) = e~ iF - a/h (4) 

we denote the operator for spatial translations. By translation covariance, the 
probability of finding the translated state U(a)ip t in the translated region V a is 
the same as that of finding ip t in V. Replacing ip t by ip' t = U(—a)i[) t the last 
statement implies 

P^(Va) = (ik, U(a)N{V)U{a)%) . (5) 

Note that the r.h.s. of this can be written as a vector norm. We can now prove 
the following results on instantaneous spreading. 

Theorem 1 ||. Consider a free relativistic particle of positive or zero mass 
and arbitrary spin. Assume that at time t — the particle is localized with prob- 
ability 1 in a bounded region V. Then there is a nonzero probability of finding 
the particle arbitrarily far away at any later time. 

Proof: The proof is by contradiction. Let t > be arbitrary but fixed and 
assume that at time t the particle is localized with probability 1 in some large, 
but finite, region containing V. This implies that for all sufficiently large |a|, 
larger than some constant R t say, one has P^ t (V a ) = 0, as indicated in Fig. 1. 
Eq. (5) then becomes 
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Figure 1: Maximal localization region at time t has no intersection with V a . 



||iV(K) 1/2 f/(a)Vt|| 2 = for all |a| > R t . (6) 

Thus the vector inside the norm vanishes. Multiplying the vector by iVfV) 1 / 2 
one obtains 

N(V)U(a)ty t = for all |a| > R t . (7) 
Taking the scalar product of this with ipQ yields 

(AT(V)Vo, U(-a)ip t ) = for |a| > Rt . (8) 
We now introduce the abbreviations 

<f) EE iV(V)V>0 

/ t (x) = (</>,U(-x)A) ■ (9) 

In momentum space P becomes multiplication by p and thus the r.h.s. of Eq. 
(H) can be written as 

/*(*) = / (p2+ ^ 2)1/2 0(p)* ■ ^Me-*^^~ 2t e tp x (10) 



where a summation over spin variables is understood and where m = is allowed. 

Since, by Eq. (|7|), ft(x) vanishes outside the ball of radius R t around the 
origin (which means that ft has compact support) its Fourier transform must 
be analytic, both for t = and the fixed t > under consideration. But it is 
evident from Eq. ( |T0"D that the Fourier transform of ft{x) is the expression over 
the brace. However, because of the root in the exponent this expression cannot 
be analytic in p for two distinct values of t unless ft = 0. For t = and x = 
the latter would imply 

(JV(V)Vo,-0o> = O, 
which is a contradiction. This concludes the proof. 
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It is worthwhile to point out that Theorem 1 carries over to unbounded regions 
V like infinite slabs of the form {(x); \xi\ < d} and that the spreading is indeed 
over all space ('no holes') and relativity is not needed, as shown by the author and 



Ruijsenaars |Tl[. Recently the author |L2[ showed the stronger result that Hilbert 
space and positivity of the energy (Hamiltonian) yield instantaneous spreading 
for systems. 

In Ref. [[HI the author proved a stronger result for exponential tails. We say 
that a particle is localized with exponentially bounded tails if the probability of 
finding it outside a ball B r of radius r around the origin, i.e. in M 3 \B r , decreases 
at least as exp{— Kr} for large r, with some K > m > 0. It was shown that if a 
particle is thus localized at time t = then at any later time t one has, for any 
c > and any r > 0, 

(M 3 \B ro+ct ) > P^ (M 3 \B ro ) . (11) 

The strict inequality sign is the main point here. For relativistic systems the 
same result holds if the tails are bounded Gaussian-like with exp{— K'r 2 }, some 
K 1 > 0. 




Figure 2: Probability flow of speed at most c. 



As seen from Fig. 2, if the change of localization probability with time were 
due to a probability flow of speed at most c, then the probability of finding the 
particle at time t outside the ball B ro+ct could only have grown by the amount 
between the two spheres and could therefore not exceed that at time t = outside 
the ball B ro . Just this is violated in Eq. (|TT|). 

Eq. flllD yields Theorem 1 if one chooses V C B ro and assumes P^ (M 3 \V) = 
0. Then the r.h.s. of Eq. ([11]) is zero while the l.h.s. is nonzero so that the 
probability of finding the particle outside any large ball in nonzero. 



3 The two-atom model of Fermi 



To check finite propagation speed Fermi [l^j supposed in his model that by some 
means one had prepared, at time t = 0, an atom A in an excited state \ca) and 
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some distance R away an atom B in its ground state, with no photons initially 
present. The state of the system then develops in time and Fermi calculated the 
probability to find the state \gA) \qb) |0 p h) at time t. He probably had in mind 
that this only could occur by de-excitation of A accompanied by an emission of 
a photon and then absorption of the latter by B together with an excitation of 
B. However, actually to check that there are no photons requires, at least in 
principle, photon measurements over all space. Hence this 'exchange' probability 
cannot be used for signals, it just refers to statistical correlations. In the spirit 
of Fermi a better approach would be to calculate the probability of finding B 
excited, irrespective of the state of A and possible photons. This approach was 
analyzed by the present author in Refs. |L5|, [L6|. 

If the observable '£> is in an excited state' makes sense and is represented by 
an operator O eg then 



is the probability to find B excited. Again, since probabilities lie between zero 
and 1, one must have 



For example, O eg might be a projection operator, as for bare states, but more 
general possibilities are conceivable. The explicit form of O eb is not needed, only 
Eqs. (|l"2D and (|H|) are used. 

To more clearly separate what is physics and what mathematics the next the- 
orem is phrased in purely mathematical terms. Application to physics will be 
explained subsequently. 

Theorem 2 [fL5| . Let the operator H be self-adjoint and bounded from below. 
Let O be any operator satisfying 



(12) 



< o en < 1 . 



(13) 



< O < const. 



(14) 



Let ip be any vector and define 



ipt = e 



-iHt 



^0 • 



Then one of the following two alternatives holds. 



(i) (ip t , Oip t ) ^ for almost all t 

(and the set of such t's is dense and open) 



(ii) (^,O^)=0 



for all t . 
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The proof can be found in Refs. WR, [L6|. In the application to Fermi's model 



H would be the Hamiltonian, O = O eg and ip would represent the initial physical 
state in which A is excited, B not, and with no photons so that (-^o, O eg ipo) = 0. 
One would then conclude that B is either immediately excited with nonzero 
probability, or never. 



The above mathematical result has been used recently by the author JT2[ to 
show that translation invariance is not needed for instantaneous spreading of par- 
ticles or system. The Hamiltonian can be quite general, only boundedness from 
below is required, and this ensures either instantaneous spreading or confinement 
in a fixed bounded region for all times, corresponding to the alternatives (i) and 
(ii) above. 



4 Spreading in relativistic quantum mechanics 

At first sight the Dirac equation might seem to be a counterexample to our results 
on instantaneous spreading. Indeed, this wave equation is hyperbolic, implying 
finite propagation speed. For the localization operator N(V) one might take the 
characteristic function xv{&), just as in the nonrelativistic case of Eq. (JT]) and in 
contrast to the (nonlocal) Newton- Wigner operator. Then, for a wave function 
initially vanishing outside a finite region, i.e. of finite support, the localization 
does evolve with finite propagation speed! Doesn't this contradict Theorem 1 
above? 

This example is instructive since it shows the importance of the positive- 
energy condition. The Dirac equation contains positive and negative energy 
states, and therefore we conclude from our results that positive-energy solutions 
of the Dirac equation always have infinite support to begin with! This is phrased 
as a mathematical result for instance in the book of Thaller 11171 . 



If there are no strictly localized states in the theory then Theorem 1 does not 
apply! Strict localization of a state if) in a region V means that (if), N(V)if)) = 1, 
and this gives 

0=^,(l-N(V))iP) = Ul-N(V)) 1 / 2 iP\\ 2 
where the root exists by Eq. (|3|). Similar to Eqs. (|6|) and (|7J) this implies 

N{V)if) = if). (15) 

Hence if> is an eigenvector of N(V) for the eigenvalue 1 if if> is strictly localized 
in V, and vice versa. The eigenvalue means strict localization outside V. 

The existence or nonexistence of strictly localized states depends on the form 
of N(V). For example, if one has a self-adjoint position operator X with com- 
muting components, then N(V) is a projection operator from the spectral de- 
composition of X and thus has eigenvalues 1 and 0. Hence in this case there are 
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strictly localized states for any region V, and Theorem 1 implies instantaneous 
spreading. 

This instantaneous spreading also occurs for position operators with self- 
adjoint but non-commuting components JQ. Each JQ has a spectral decomposi- 
tion whose projection operators give the localization operators for infinite slabs. 
Eigenvectors for the eigenvalue 1 represent states strictly localized in these slabs, 
and by the remark after Theorem 1 there is instantaneous spreading in this case, 
too. 

To avoid instantaneous spreading one has therefore to consider localization 
operators N(V) which are not projectors, for example positive operator-valued 
measures. However, if one insists on arbitrary good localization, i.e. on tails 
which drop off arbitrarily fast, then one runs into Eq. (|TT|). This equation can 
be interpreted as stemming from an infinitely fast probability flow. 

If in the Fermi model atoms A and B would develop instantaneous tails then 
an immediate excitation would not seem surprising. So the two phenomena ap- 
pear to be connected. Mathematically this connection is of course given by the 
assumed positivity of the energy and has been discussed in detail by the author 



in Ref. 12 



Could instantaneous spreading be used for the transmission of signals if it 
occurred in the framework of relativistic one-particle quantum mechanics? Let 
us suppose that at time t — one could prepare an ensemble of strictly localized 
(non-interacting) particles by laboratory means, e.g. photons in an oven. Then 
one could open a window and would observe some of them at time t = e later 
on the moon. Or to better proceed by repetition, suppose one could successively 
prepare strictly localized individual particles in the laboratory. Preferably this 
should be done with different, distinguishable, particles in order to be sure when 
a detected particle was originally released. Such a signaling procedure would 
have very low efficiency but still could be used for synchronization of clocks or, 
for instance, for betting purposes. 



5 Field-theoretic aspects and discussion 

Localization of particles. In field theory difficulties with particle localization have 
been known for a long time However, our results are more general since 

no fields are involved and none of the particular assumptions or axioms of field 
theory, except for positivity of the energy, are used. 

In a field-theoretic context permanent infinite tails can be understood in- 
tuitively through clouds of virtual particles due to renormalization ('dressed 
states'). It is also conceivable that whenever one tries to prepare a localized 
particle or system one automatically creates particle-antiparticle pairs outside 
the desired localization region, and this would have the same effect as tails. 

Instead of speaking about infinite tails one may also envisage that all particle 
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detectors exhibit, for example, inherent noise and that therefore localization with 
probability 1 or zero can never be recorded. This would essentially lead to the 
same conclusions as permanent infinite tails. 

Fermi's model. From the foregoing discussion it is evident that, in a field- 
theoretic context, excitation of atom B need not be due to absorption of a photon 
emitted by atom A. The excitation could rather be due to vacuum fluctuations, 
photon clouds etc. Or the excitation maybe just spontaneous, whatever that 
means. At a more fundamental level, the notion of bounds states has its intricacies 
in field theory, and the corresponding observables in the Fermi model might be 
hard to put on a mathematically rigorous footing. 

If infinite tails always exist, or if all counters are influenced by vacuum fluc- 
tuation, then how can finite propagation speed or Einstein causality be checked 
at all? Here it is useful for differentiate between two notions of causality. 

Strong causality. By this we mean that for each individual experiment in 
which two systems, separated by a distance R, are prepared at time t = no dis- 
turbance or excitation of the second system occurs for t < R/c |16|| . This notion 
is analogous to energy momentum and baryon conservation in each individual 
scattering process in particle physics. Strong causality would hold in the Fermi 
model if the transition probability were strictly zero for t < R/c. It seems that 
Fermi had this causality notion in mind. Our results show that strong causality 
cannot be checked (unless a possible way out via cut-off theories holds [0), or it 
may fail in a theory. 

Weak causality. This notion was introduced by Schlieder [F2(J, and it means 
Einstein causality for expectation values or ensemble averages only, not for each 
individual process. Thus for the above two systems, expectation values for the 
second system need not vanish for t < R/c, but it would take at least a time 
t = R/c to produce an effect on them. To exhibit this effect it is convenient 
to subtract possible fluctuations of the second system alone. Theorem 2 does 
not apply to this situation since this difference is not the expectation value of a 
positive operator. 

The weak assumptions of the above theorems (just Hilbert space and positive 



energy) will not be enough to prove weak causality. In Refs. [51, ^] the Fermi 
model has been studied using the methods and approximations of quantum optics. 
Vacuum fluctuations and virtual photons contribute to the excitation of atom B, 
and once the expectation value of this contribution has been subtracted, the 
remainder behaves causally, at least within the approximations employed. This 
just corresponds to the notion of weak causality. In how far this can be measured 
will be discussed below. 

In the framework of quantum field theory it is sometimes simply argued that 
local commutation or anti-commutation relations must clearly ensure causal be- 
havior, for instance of localized particles. This implicitly presupposes, however, 
that the relevant operators, e.g. N(V) or O eg -if they exist - are local functions 
of the fields. It should be recalled that fields do not enter in our formulation, 
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and it should be noted that if N(V) is a local function of the fields then the 
Reeh-Schlieder theorem [1{| implies that its vacuum expectation is nonzero. 

Within local quantum field theory a rigorous proof of weak causality for local 
observables has been given by Schlieder [20] and by Buchholz and Yngvason [23] . 
as well as by Neumann and Werner in an alternative algebraic framework. 
In Ref. |23| it was moreover shown that restrictions of states to local algebras 
cannot be tested by means of transition probabilities. 

How can one check weak causality experimentally? In the Fermi model one 
would not use a single pair of atoms but rather an ensemble, either by repetition 
or by simultaneous realization. If in the Fermi case one has N pairs of atoms A 
and B one would measure at time t how many B atoms are excited. For iV — > oo 
their fraction would be given by (ip t , O eB ip t ), while for finite N this would hold 
only approximately, due to statistical fluctuations. Then one would subtract the 
- either calculated or measured - excitation probability of B without A present. 
Weak causality asserts that the difference should be zero for t < R/c - but only 
for N — > oo, while for finite N there are always fluctuations. Hence in a strict 
sense, weak Einstein causality can only be checked experimentally for infinite 
ensembles. This suggests a macroscopic context. 



The main point of our results seems to be that instantaneous spreading holds 
already under amazingly few assumptions. Neither the existence of fields nor 
the usual axioms of field theory are assumed - the only input is Hilbert space 
and positivity of the energy. Our results seem to indicate the need for a mech- 
anism like vacuum fluctuations, clouds of virtual particles, particle-antiparticle 
pairs, spontaneous excitations, or something like that in order to retain Einstein 
causality. Our results are compatible with quantum field theory which uses much 
stronger assumptions and in which vacuum fluctuations etc. are present. 
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